Organizing Digital Libraries by Automated Text Categorization∗

نویسندگان

  • Henri Avancini
  • Andreas Rauber
  • Fabrizio Sebastiani
چکیده

Text Categorization (TC) is the discipline concerned with the construction of automatic text classifiers, i.e. programs capable of assigning to a document one or more among a set of ∗This is an extended version of an invited paper presented by the third author at the Workshop on Artificial Intelligence for Cultural Heritage and Digital Libraries, co-located with the 7th Conference of the Italian Association for Artificial Intelligence, Bari, IT, September 2001.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

AUTOMATED CATEGORIZATION SCHEME FOR DIGITAL LIBRARIES IN DISTANCE LEARNING: A Pattern Recognition Approach

Digital libraries play a crucial role in distance learning. Nowadays, they are one of the fundamental information sources for the students enrolled in this learning system. These libraries contain huge amount of instructional data (text, audio and video) offered by the distance learning program. Organization of the digital libraries is therefore very important for easy and fast access to the de...

متن کامل

Organizing and Using Digital Libraries by Automated Text Categorization

When it was proclaimed that the Library contained all books, the first impression was one of extravagant happiness. All men felt themselves to be the masters of an intact and secret treasure. There was no personal or world problem whose eloquent solution did not exist in some hexagon. (. . . ) As was natural, this inordinate hope was followed by an excessive depression. The certitude that some ...

متن کامل

A Scalable Self-organizing Map Algorithm for Textual Classification: A Neural Network Approach to Thesaurus Generation

The rapid proliferation of textual and multimedia online databases, digital libraries, Internet servers, and intranet services has turned researchers' and practitioners' dream of creating an information-rich society into a nightmare of info-gluts. Many researchers believe that turning an info-glut into a useful digital library requires automated techniques for organizing and categorizing large-...

متن کامل

A Novel Text Categorization Approach based on K-means and Support Vector Machine

Continuous expansion of digital libraries and online news, the huge amount of text documents is existing on the web. Consequently the need is to organize them. Text Categorization is an active analysis field can be used for organizing text document. Text categorization is the process of assigning documents with predefined categories that are associated with their contented. CAWP algorithm is de...

متن کامل

Number 14

Continuous expansion of digital libraries and online news, the huge amount of text documents is existing on the web. Consequently the need is to organize them. Text Categorization is an active analysis field can be used for organizing text document. Text categorization is the process of assigning documents with predefined categories that are associated with their contented. CAWP algorithm is de...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002